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We consider large-scale service systems with multiple customer 
classes and multiple server pools; interarrival and service times are 
exponentially distributed, and mean service times depend both on 
the customer class and server pool. It is assumed that the allowed 
activities (routing choices) form a tree (in the graph with vertices 
being both customer classes and server pools) . We study the behavior 
of the system under a Leaf Activity Priority (LAP) policy, which 
assigns static priorities to the activities in the order of sequential 
"elimination" of the tree leaves. 

We consider the scaling limit of the system as the arrival rate 
of customers and number of servers in each pool tend to infinity in 
proportion to a scaling parameter r, while the overall system load 
remains strictly subcritical. Indexing the systems by parameter r, 
we show that (a) the system under LAP discipline is stochastically 
stable for all sufficiently large r and (b) the family of the invariant 
distributions is tight on scales ri +t for all e > 0. (More precisely, the 
sequence of invariant distributions, centered at the equilibrium point 
and scaled down by r~^ +e ' , is tight.) 

1. Introduction. Large-scale service systems with heterogeneous cus- 
tomer and server populations bring up the need for efficient dynamic control 
policies that dynamically match arriving (or waiting) customers and avail- 
able servers. It is desirable to have algorithms that avoid excessive customer 
waiting and do not rely on the knowledge of system parameters. 

Consider a service system with multiple customer and server types, where 
the arrival rate of class i customers is Aj, the service rate of a class i customer 
by a type j server is fiij , and the server pool sizes are Bj . A desirable feature 
of a dynamic control is insensitivity to parameters Aj and fUj. That is, 
the assignment of customers to server pools should, to the maximal degree 
possible, depend only on the current system state (server occupancies, queue 
sizes), and not on prior knowledge of arrival rates or mean service times, 
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because those parameters may not be known in advance and, moreover, 
they may be changing in time. 

If the system objective is to minimize the largest average load of any 
server pool, a "static" optimal control can be obtained by solving a linear 
program, called static planning problem (SPP), which has Bj's, /Xy's and 
Aj's as parameters. An optimal solution to the SPP will prescribe optimal 
average rates Ajj at which arriving customers should be routed to the server 
pools. Typically (in a certain sense) the solution to SPP is unique and the 
basic activities, i.e. routing choices (ij) for which Ajj > 0, form a tree; let 
us assume this is the case. Probabilistic routing with static probabilities 
Ajj/Ai of routing a customer of type i to server pool j, will balance the 
loads among different server pools and will avoid excessive customer waiting; 
however, in order to find the routing probabilities, it is necessary to know all 
of the parameters Aj, Bj, and /ijj in advance. The Shadow Routing policy in 
[6] is a dynamic control policy, which achieves the load balancing objective 
without a priori knowledge of input rates Aj; in the process it "automatically 
identifies" the basic activity tree. Shadow Routing policy, however, does need 
to "know" the service rates fiij. 

In this paper we assume that the basic activity tree is known, but not 
the precise rates Aj, we restrict the routing choices to activities within 
the basic activity tree. We consider the large-number-of-servers asymptotic 
regime, in which the arrival rate of customers and number of servers in each 
pool tend to infinity in proportion to a scaling parameter r; our focus is 
on the case where the overall system load remains strictly subcritical. In 
a previous paper [8] we showed that a very natural load balancing policy 
considered e.g. by [4], [1], [2] may lead to instability at the system equilib- 
rium point: in particular, for certain parameter settings [8, Theorem 7.2] 
demonstrated the non-tightness (in fact - evanescence to infinity) of invari- 
ant measures on the diffusion, ra- scale. (More precisely, this means that 
the sequence of invariant distributions, centered at the equilibrium point 
and scaled down by r _ 2 ; is non-tight, and moreover - escapes to infinity.) 

In this paper we consider a different algorithm, which we call the Leaf 
Activity Priority (LAP) policy. As specified above, no precise knowledge of 
the rates Aj and fiij is required, besides the knowledge of the basic activity 
tree, and routing is restricted to basic activities only. The policy assigns 
static priorities to the activities in the order of sequential "elimination" of 
the tree leaves. The precise definition will be given in Section 2.2. Assum- 
ing strictly subcritical load, for this policy we first prove that the system 
is stochastically stable for all sufficiently large values of r. (In contrast to 
load balancing policies, the stability under LAP is not "automatic".) Next, 
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we demonstrate the r-scale (fluid-scale) tightness of stationary distributions; 
this fact is closely related to stability - both are "consequences" of the rela- 
tively "benign" behavior of the system on the fluid scale. Then, we obtain a 
much stronger tightness result, namely that the invariant distributions are 
tight on the r5 +e -scale, for any e > 0; this is the main contribution of the 
paper, which involves the analysis of the process under hydrodynamic and 
local-fluid scaling (in addition to "standard" fluid scaling). We believe that 
our analysis can be extended to prove still stronger, diffusion scale (r 1 / 2 ) 
tightness; this is work in progress. 

For a general review of literature on the large-number-of-servers asymp- 
totic regime, including design and analysis of efficient control algorithms, 
see e.g. [4, 6] and references therein. 

The rest of the paper is organized as follows. In Section 2 the model, the 
asymptotic regime, LAP discipline and basic notation and introduced. The 
main results are stated in Theorem 10 of Section 3, with its statements (i) 
and (ii) being the stability and tightness results, respectively. Section 4 con- 
tains the analysis of the process on the fluid scale, which leads to establishing 
stability (Theorem 10(i)) and fluid scale (r-scale) tightness of stationary dis- 
tributions. In Section 5, using the fluid-scale tightness as a starting point, we 
prove the r 1 / 2+e -scale tightness (Theorem 10(h)); this is the key part of the 
paper, which involves the analysis of system dynamics under LAP discipline 
under hydrodynamic and local-fluid scaling. 

2. Model. 

2.1. The model; Static Planning (LP) Problem. Consider the model in 
which there are I customer classes, labeled 1,2,...,/, and J server pools, 
labeled 1, 2, . . . , J. (Servers within pool j are referred to as class j servers. 
Also, throughout this paper the terms "class" and "type" are used inter- 
changeably.) The sets of customer classes and server pools will be denoted 
by I and respectively. We will use the indices i, i' to refer to customer 
classes, and j, j' to refer to server pools. 

We are interested in the scaling properties of the system as it grows large. 
The meaning of "grows large" is as follows. We consider a sequence of sys- 
tems indexed by a scaling parameter r. As r grows, the arrival rates and the 
sizes of the service pools, but not the speed of service, increase. Specifically, 
in the rth system, customers of type i enter the system as a Poisson process 
of rate \\ = rAj, while the jth server pool has r/3j individual servers. (All 
Aj and (3j are positive parameters.) Customers may be accepted for service 
immediately upon arrival, or enter a queue; there is a separate queue for 
each customer type. Customers do not abandon the system. When a cus- 
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tomer of type i is accepted for service by a server in pool j, the service time 
is exponential of rate p,ij\ the service rate depends both on the customer 
type and the server type, but not on the scaling parameter r. If customers 
of type % cannot be served by servers of class j, the service rate is pij = 0. 

Remark 1. Strictly speaking, the quantity (3jr may not be an integer, 
so we should define the number of servers in pool j as, say, [(3jr\ . However, 
the change is not substantial, and will only unnecessarily complicate the 
notation. 

Consider the following, load-balancing, static planning problem (SPP): 
(la) minp, 

subject to 

(lb) \j >0, Vi,j 

(lc) ^Ajj = Ai, Mi 

j 

(Id) J>,- i )>,)- :/>. Vj. 

i 

Throughout this paper we will always make the following two assumptions 
about the solution to the SPP (1): 

Assumption 2 (Complete resource pooling). The SPP (1) has a unique 
optimal solution {Xij, i € I, j £ J}, p. Define the basic activities to be the 
pairs, or edges, (ij) for which > 0. Let £ be the set of basic activities. 
We further assume that the unique optimal solution is such that £ forms a 
tree in the (undirected) graph with vertices set X U J . 

Assumption 3 (Underload). The optimal solution to (1) has p < 1. 

Remark 4. Assumption 2 is the complete resource pooling (CRP) con- 
dition, which holds "generically" in a certain sense; see [7, Theorem 2.2]. 
Assumption 3 is essential for the main results of the paper (r 2 +<E -scale tight- 
ness), but many of the auxiliary results hold (along with their proofs) for 
the critically loaded case p = 1. 



STEADY-STATE TIGHTNESS, PRIORITY DISCIPLINE 



5 



Note that under the CRP condition, all ("server pool capacity") con- 
straints (Id) are binding: J2i ^ij / (PjPij) = Pi Yj. This in particular means 
that the optimal solution to SPP is such that, if a system with parameter r 
will route type i customers to pool j at the rate A^r, the server pool average 
loads will be minimized and "perfectly balanced" . 

In this paper, we assume that the basic activity tree is known in advance, 
and restrict our attention to the basic activities only. Namely, we assume that 
a type i customer service in pool j is allowed only if (ij) G £■ (Equivalently, 
we can a priori assume that £ is the set of all possible activities, i.e. pij = 
when (ij) G" £, and £ is a tree. In this case CRP requires that all feasible 
activities are basic.) For a customer type i, let S(i) = {j : (ij) G £}; for a 
server type j, let C(j) = {i : (ij) G £}. 

Under the CRP condition, optimal dual variables Vi, i G X, and ay, j G J , 
corresponding to constraints (lc) and (Id), respectively, are unique and all 
strictly positive. The dual variable is interpreted as the "workload" asso- 
ciated with one type i customer, and is interpreted as the (average) rate 
at which one server in pool j processes workload when it is busy, regardless 
of the customer type on which it is working, as long as i G C(j). The dual 
variables satisfy the relations = ctj//3j for any (ij) G £, and . aj = 1, 
which in particular imply that 

(2) Yl v ^ = 22 ^ = 22 Xi > = Yl a >Yl j^t. = p- 

i i j i j ^ l 3 j i ^ 

Given p < 1, this means, for example, that when all servers in the system are 
busy, the total rate Y2i v 0^i r a t which new workload arrives in the system 
is strictly less than the rate ^ ■ a^r = r at which it is served. 

Remark 5. Although (1) is the load-balancing SPP, and the notions 
introduced in this subsection are defined in terms of this SPP, the policy 
we consider in this paper (defined in Section 2.2) is not a load balancing 
policy. In particular, the system equilibrium point under the policy, will not 
balance server pool loads, but rather will keep all pools, except one, fully 
occupied. 

2.2. Leaf activity priority (LAP) policy. For the rest of the paper, we 
analyze the performance of the following policy, which we call leaf activity 
priority (LAP). The first step in its definition is the assignment of priorities 
to customer classes and activities. 

Consider the basic activity tree, and assign priorities to the edges as fol- 
lows. First, we assign priorities to customer classes by iterating the following 
procedure: 
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1. Pick a leaf of the tree; 

2. If it is a customer class (rather than a server class), assign to it the 
highest priority that hasn't yet been assigned; 

3. Remove the leaf from the tree. 

Without loss of generality, we assume the customer classes are numbered 
in order of priority (with 1 being highest). We now assign priorities to the 
edges of the basic activity tree by iterating the following procedure: 

1. Pick the highest-priority customer class; 

2. If this customer class is a leaf, pick the edge going out of it, assign this 
edge the highest priority that hasn't yet been assigned, and remove 
the edge together with the customer class; 

3. If this customer class is not a leaf, then pick any edge from it to a 
server class leaf (such necessarily exists), assign to this edge the highest 
priority that hasn't yet been assigned, and remove the edge. 

It is not hard to verify that this algorithm will successfully assign priorities 
to all edges; it suffices to check that at any time the highest remaining 
priority customer class will have at most one outgoing edge to a non-leaf 
server class. 

Remark 6. This algorithm does not produce a unique assignment of 
priorities, neither for the customer classes nor for the activities, because 
there may be multiple options for picking a next leaf or edge to remove, 
in the corresponding procedures. This is not a problem, because our results 
hold for any such assignment. Different priority assignments may correspond 
to different equilibrium points (defined below in Section 2.3); once we have 
picked a particular priority assignment, there is a (unique) corresponding 
equilibrium point, and we will be showing steady-state tightness around 
that point. Furthermore, the flexibility in assigning priorities may be a useful 
feature in practice. For example, it is easy to specialize the above priority 
assignment procedure so that the lowest priority is given to any a priori 
picked activity. 

We illustrate one such priority assignment in Figure 1. 

We will write (ij) < (i'f) to mean that activity (ij) has higher priority 
than activity (i'j')- It follows from the priority assignment algorithm that 
i < i! (customer class i has higher priority than i') implies (ij) < (i'j')- In 
particular, if j = j', we have (ij) < (i'j) if and only if i < i' . Without loss of 
generality, we shall assume that the server classes are numbered so that the 
lowest-priority activity is (IJ). (In Figure 1, this corresponds to assigning 
the number 3 to server pool C.) 
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Fig 1. An example assignment of priorities to customer classes and activities to an ex- 
ample network. Circles represent customer classes, squares represent server pools. 

Now we define the LAP policy itself. It consists of two parts: routing and 
scheduling. "Routing" determines where an arriving customer goes if it sees 
available servers of several different types. "Scheduling" determines which 
waiting customer a server picks if it sees customers of several different types 
waiting in queue. 

Routing: An arriving customer of type i picks an unoccupied server in 
the pool j G S(i) such that (ij) < (ij') for all f G S(i) with idle servers. If 
no server pools in S(i) have idle servers, the customer queues. 

Scheduling: A server of type j upon completing a service picks the cus- 
tomer from the queue of type i E C(j) such that i < i' for all i! G S(i) with 
Qi' > 0. If no customer types in C(j) have queues, the server remains idle. 

We introduce the following notation (for the system with scaling param- 
eter r): 

i&ij (t) , the number of servers of type j serving customers of type i at time 
t; 

Qi(t), the number of customers of type i waiting for service at time t. 

2.3. LAP equilibrium point. Informally speaking, the equilibrium point 
(ip*j,q*)(ij)<zE,iei i s the desired operating point for the (fluid scaled) vec- 
tor (vE^/r, Q[ /V)(r/')e£,je:r of occupancies and queue lengths under the LAP 
policy. Specifically, we will be showing that in steady state the fluid-scaled 
vector converges in distribution to the equilibrium point, and will then show 
that the deviations from it are small. We define the equilibrium point below; 
it will be the stationary point of the fluid models defined in Section 4. 

The LAP discipline is not designed with load balancing in mind, so its 
equilibrium point does not, of course, achieve load balancing among the 
server pools. To define it, we recursively define the quantities Ay > 0, which 
have the meaning of routing rates, scaled down by factor 1/r. (These Ajj are 
not the same as those given by the optimal solution to the SPP (1).) For the 
activity (lj) with the highest priority, define either Ay = Ai and -i/'ij = ^7, 
or = (3j and Aij = Pjfiij, according to whichever is smaller. Replace Ai 
by Ai — Ay and f3j by f3j — ^* •, and remove the edge (lj) from the tree. We 
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now proceed similarly with the remaining activities. 
Formally, set 

Xij = min I A; - V> /% f Pi ~ Y 177 ) ) ' 

\ f--(ij')<(ij) V i'Ki^'^JJ 

Since the definition is in terms of higher-priority activities, this defines the 
(Ajj)(jj) e £ uniquely. The LAP equilibrium point is defined to be the vector 

(V , ij>Q , i *)(ij)e£',iex 

given by 

(3) = — , g?=OforaU(y)€f,i€X. 

(Since we're in the underloaded case p < 1, all queues should be at equi- 
librium.) 

To avoid trivial complications, throughout the paper we make the follow- 
ing assumption: 

Assumption 7. If (VfyO^es are sucn that t/jij > 0, Aj = J2j fHjipij, 
and < /3j for all j, then ^ > for all (ij) G £. In particular, 

the equilibrium point satisfies this condition and, moreover, it is such that 
Ei ^ = & for all j < J and £\ V*/ < A/- 

The assumption means that the system needs to employ (on average) 
all activities in order to be able to handle the load. It holds, for example, 
whenever p is sufficiently close to 1. Indeed, suppose the arrival rates (Ai), e x 
are such that p = 1— e, and consider the system with arrival rates Xi = jz^Aj. 
The basic activity tree £ will be the same for (Aj)j g z as for (Aj)j £ x- Since 
p = 1 for (Aj)j g i and CRP holds, there exists a unique set of (ipij)(ij)e£ 
that satisfies the conditions, and it has ipij > for all (ij) G 8. Also, if e is 
sufficiently small, we must have ipij ~ ipij for all (ij) G £ , and hence ipij > 
for all (ij) G £. 

Remark 8. Assumption 7 is technical - our main result, Theorem 10, 
can be proved without it, by following the approach presented in the paper. 
But, it simplifies the statements and proofs of many auxiliary results, and 
thus substantially improves the exposition. 
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Remark 9. Although the LAP equilibrium point does not achieve load 
balancing, when system is heavily loaded (i.e. p is close to 1), the load 
balancing is achieved approximately, in the sense that all queues are kept 
small and all servers are almost fully loaded - which is the best any "load 
balancer" could do (when p is close to 1). 

2.4. Basic notation. Vector i G 1), where £ can be any symbol, is 
often written as (&); similarly, j G J) = (£,-) and (ij) G £) = 
Furthermore, we often use notation r/y) to mean i G X), (77^, (ij) G 
£)), and similar notations as well. Unless specified otherwise, = 
^ieC(j)&j an d = Y^jes(i)&j- For functions (or random processes) 

i > 0) we often write £(•). (And similarly for functions with domain 
different from [0,oo).) So, for example, (&(•)) signifies ((£,(£), i G X), i > 0). 

The symbol ==> denotes convergence in distribution of random variables 
in the Euclidean space M. d (with appropriate dimension d). The symbol — > 
denotes ordinary convergence in M. d . Standard Euclidean norm of a vector 
x G is denoted \x\, while denotes the Li-norm (sum of absolute values 
of the components); u.o.c. means uniform on compact sets convergence of 
functions, with the domain defined explicitly or by the context. For i£l, 
\x\ is the greatest integer less than or equal to x. 

3. Main result. We are now in position to state our main result. 

Theorem 10. Consider the sequence of systems under LAP policy, in 
the scaling regime and under the assumptions specified in Section 2, with 
p < 1. Then: 

(i) For all sufficiently large r, the system is stable, i.e. the countable Markov 
chain Q\ (•)) is positive recurrent. 

(ii) For any e > 0, the stationary distribution of r~ 1 l 2 ~ e (W i j(-) — ^*,-r, Q\{-)) 
weakly converges to 0. 

The proof is given in the rest of the paper, and consists roughly of two 
stages. First, we study the process under the fluid scaling r~ (^L(-)i ("))> 
which allows us to prove stability and statement (ii) for e = 1/2. Then we 
need a more detailed analysis, involving hydrodynamic and local-fluid scaling 
of the process, to prove (ii) for any e > 0. 

Throughout the paper, we will use the following additional notation for 
the system variables. For a system with parameter r, we denote: 
XT(t) = + Qi CO is t ne total number of customers of type i in the 

system at time t; 

A$(t) is the total number of customers of type i exogenous arrivals into the 
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system in interval [0,t]; 

D^j(t) is the total number of customers of type i that completed the service 
in pool j (and departed the system) in interval [0, t]; 

H^-(i) is the total number of customers of type i that entered service in pool 
j in interval [0,t]. 

There are some obvious relations between realizations of these processes: 



Q r i(t) = Qr(0) + A\{t) - J2j ^(t); Q r S) > implies £\, %j(t) = PjT for 



each j G <5(i); and so on. 

We can and will assume that a random realization of the system with 
parameter r is determined by its initial state and realizations of "driv- 
ing" unit-rate, mutually independent, Poisson process nj a) (-),i G X and 
Hi? (•)»(#) as follows: 



the driving Poisson processes are common for all r. 

4. Fluid scaling. We begin by analyzing the LAP discipline on the 
fluid scale. Namely, consider the scaling 



Proposition 11. Suppose (V^(0),g[(0)) ->■ (^;j(0),%(0)). Then, w.p.l, 
for any subsequence r — > oo there exists a further subsequence along which 
(V'ijCO) x i (")> a i ("); dlj(-), converges uniformly on compact sets to 

a set tyij('),qi(-),Xi{-),cii(-),dij(')>(iij(')) of Lipschitz continuous functions 
satisfying conditions (4). The conditions involving derivatives are to be sat- 
isfied at all regular points of the limiting set of functions. (A time point 
t > is regular if both minimum and maximum of any subset of component 
functions have derivatives at t. All points t > are regular, except a subset 
of zero Lebesgue measure.) 

The fluid model conditions are: 



(4a) 9i (t) > 0, Vi e X; ^(t) > 0, V(ij) G £; Yl ^(*) ^ V -? e ^ 



(4b) ai(t) = Xit, Mi el; dij(t) = / mjTpij(s)ds, V(ij) G £ 





J (i) , (t) , xf (t) , ^-(t) , ^ (*) , H& (t)) . 
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(4c) qi(t)=qi(0) + ai{t)-^2^j{t), Vi G X; 

3 

Aj{t) = i>ij(0) + fo(t) - dij(t), V(ij) G £ 
(4d) Xi(t) = qi(t) + y2-i/jij(t) = Xi(0) + \it-S2 Vij4>ij(.s)ds, G X 

(4e) y~] (t) = whenever &/(<) > for at least one i' G C(j) 



(4f) ^£jj(*) = 0, whenever qv(i) > for at least one i! G C(j), i' < i 
(4g) ^Cij(t) = 0, whenever ^V'fcj'W < f° r at least one (if) < (ij) 



( 4h ) ^» 3 w = 5]w'j^w- Yl ^&'(*)' 

i' (i'j)<(ij) 
whenever qrj(i) > (and then necessarily ^^^fcj(i) = /3j) 



(4i) ffij(t) = \- £ l^'W' 

(ij'Xij) 

whenever ^^(t) < (and then necessarily Qj(t) = 0) 



jfijit) = min A * " S Jt^'^ 1 

\ (ij'Xij) 
V (i'j)<(ij) 



whenever qi(t) = and ^^fcj(t) = 
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Proof of Proposition 11. The proof of convergence fact and of the 
basic conditions (4a)- (4d) of the limit, is very standard. Indeed, it follows 
from the Functional Strong Law of Large Numbers (FSLLN) for the driving 
processes, and the scaling applied, that w.p.l each component function is 
asymptotically Lipschitz. For example, for each scaled departure process we 
have: w.p.l, for a fixed large C > and any < t\ < t<i < oo, 

limsupd-jf^) - dijih) < C(t 2 ~h). 

This implies that, w.p.l any subsequence of r has a further subsequence 
along which a u.o.c. convergence dL(-) —> <%(•) holds, where dij(-) is Lips- 
chitz. Similar convergence property holds for each arrival process. From here 
we obtain the convergence (along a subsequence) for all other components. 
Then, relations (4a)- (4d) are inherited from the corresponding conservation 
laws for the pre-limit trajectories. 

Properties (4e)-(4i) easily follow from the priority rule of LAP; it suffices 
to consider the behavior of pre-limit trajectories in a small time interval 
[t,t + 5] when r sufficiently large. (See e.g. [5, Theorem 1] for this type of 
argument.) 

Finally, to show (4j) we recall that, by the priority assignment procedure, 
for the activity (ij): either (ij) has the lowest priority among activities asso- 
ciated with customer class i or (ij) has the lowest priority among activities 
associated with server pool j (or both). Taking into account that point t is 
regular (which in particular implies q'^t) = and (d/dt) ^ fc ipkj(t) = 0), we 
easily see that in the former case the only possibility is that 

(ij')<(ij) >' (i'j)<{ij) 

and in the latter case we must have 

(i'j)<(ij) (»i')<(y) 

This implies (4j). We omit further details of the proof which are, again, 
rather standard. □ 

We call any Lipschitz solution (ipij (•), qi(-), Xi(-), a«(-), dij(-),£,ij(-)) of (4) a 
fluid model of the system with initial state (^ (0), %(0)); a set (ipij(-),qi(-)), 
which is a projection of a fluid model we often call a fluid model as well. 
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Remark 12. It will not be important for the results in the paper whether 
the fluid model with given initial conditions is unique; all that will matter 
is the long-term behavior of all fluid models with given initial conditions. 

Proposition 13. For any e' > and any K > there exists a finite 
time T = T(K) such that all fluid models whose starting state satisfies 
(ipij(0),qi(0))\ < K have = Vj < J, qi {t) = 0, Vi G I, and 

ipij(t) - < e' for all (ij) G £, for all t > T{K). 

Sketch of proof. For the highest priority activity (lj) there are two 
cases. 

Case a: Type 1 is a leaf. In this case j is the unique server to which type 
1 jobs are allowed to go, and they have the highest priority there. Pick a 
small 5 > 0. After a finite time (uniformly bounded above, across all starting 
states as in the proposition statement), the condition ipij(t) > ip^j — 5 must 
hold, because ipij(t) < V*j ~~ S implies that (d/dt)ipij(t) is positive and 
bounded away from 0. After such time, qi(t) > implies X^VVj'W = Pj 
and (recall that 5 is small) Ai < /J>ijipij(t) — 5i for some <5i > 0; and therefore 
(d/dt)qi(t) < —Si. We conclude that after a finite time (uniformly bounded 
above) we must have qi(t) = 0. This in turn implies that (d/dt)ipij(t) is 
negative and bounded away from as long as ipij(t) > ip^j + 5. Thus, 
\ipij(t) — ipij\ < 5 and qiif) = after a bounded time. 

Case b: Pool j is a leaf. Then Assumption 7 implies ■ = f3j and Ai > /J-ijPj- 
In this case, ipij(t) = ip\j starting at some time (that is bounded uniformly 
on initial states), simply because (d/dt)ijjij(t) > X± — > as long as 

ifa(t) <0j. 

We see that, in either case a or b, for arbitrarily small 5 > 0, there exists 
Ti = Ti(5) such that |$y(t) - Vyl < 5. 

We proceed by induction on the activity priorities and, using Assumption 7, 
easily establish analogous properties for every activity (i'j 1 )- This implies 
the result. We omit details. □ 



Theorem 14. For all sufficiently large r, the LAP discipline stabilizes 
the network ( in the sense of positive recurrence of the underlying Markov 
process). Moreover, the sequence of invariant distributions of {ip\j,ql) is 
tight, and the invariant distributions converge weakly to the point mass at 
the equilibrium point. 

Before we proceed with the proof, we need the following lemma. 
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Lemma 15. There exists T\ > such that for any T2 > T\ there exists 
a sufficiently large C = CiT^) for which the following holds. For any e > 0, 

l5>(dk(T 2 ) - dr-CTO) - (r 2 - t{)\ > c I 0, 

(«) J 

as r — )• oo ; uniformly on initial states with maxj g jg[(0) > C. 

In turn, to prove this lemma, we will need to use fluid models with in- 
finite initial states. Note that we cannot appeal directly to the properties 
of "standard" fluid models defined earlier, because we require convergence 
that is uniform in all large initial states. So, we need the following version of 
a fluid limit result. We will use notation 1 = 1U {00} for the the one-point 
compactihcation of R. 

Proposition 16. Consider a sequence of fluid-scaled processes 
(V'ijC - )) Qi(')) with deterministic initial states such that (^(0), <zf(0)) = 
C'(r) — > 00 and 

(^(o),?r(o))-^(^(o), ft (o)), 

where each ^[(0) and Oj(0) is viewed as an element o/M. Partition the cus- 
tomer classes as I = X°° UX°, where qi(0) = 00 for i G T°° , and qi(0) < 00 
for i G Z°. (Necessarily, X°° is non-empty.) Then, with probability 1, any 
subsequence of trajectories has a further subsequence which converges u.o.c. 
to a fluid model, satisfying same conditions as (4), except that for alii G X°° 
the queue length qi{t) = 00, Vt > 0. Moreover, all such fluid models are such 
that, uniformly on all of them, starting at some finite time T[, all server 
pools are fully occupied: ^ Vij (*) = Pj > t >T[, Vjf . 

Proof of this result is very similar to that of Proposition 13 (and in fact 
simpler), so it is not spelled out here. We just note that Assumption 7 is 
essential in showing that all server pools are occupied after a finite time. 
Without the assumption, we could still show that the occupancy becomes 
strictly greater than at the equilibrium point, and that would be enough for 
our purposes; however, it would make Proposition 16 statement and proof 
more cumbersome. 

Proof of Lemma 15. Let us choose Ti = 2T[, where T[ is as in Propo- 
sition 16. Now, if the lemma statement would not hold, then for some fixed 
e' > we could find a sequence of systems with (?/>[• (0), (?[(())) = C'(r) — > 
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oo, such that 

limsupP \ |V^«,(T 2 ) - dTjiTx)) - (T 2 - T x )| > e' \ > 0. 
I «0 J 

This, however, is impossible because, by Proposition 16, w.p.l from any 
subsequence of r we can find a further subsequence such that: 

E^i^) = #7 uniformly in [Ti,T 2 ], Vj, 

'Ti 



4(^2) - -> /xyV* (*)<**, V(ij), 



and therefore 

r r i2 



/i2 
fHjipij(t)dt = 

W) W) _1 

77"; JTi Pj ~r ~r 



Ti pj - ^ ./ / , 

PROOF OF Theorem 14. Recall that i/j > is the workload associated 
with a single request of type i; i.e., the optimal dual variable associated with 
(lc) for type i. We consider the quantity 

i 

(where x\{t) = ^ <zf (*) + Y^ij ^ijW); the total workload of the system. We 
will argue that the quantity 

C r {t) = [W r (t)} 2 

will serve as a Lyapunov function for the rth system. Namely, the following 
property holds: there exist positive constants K, T, C\, C 2 , C3 such that, 
for all sufficiently large r, 

(5) if C r {t) > K then E[£ r (t + T) - C r {t)\C'\t)\ < -dW r {t) + C 2 
and 

(6) if C r (t) < K then E[C r (t + T) - C r {t)\C r {t)] < C 3 . 
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(The proof of (5)- (6) is given after we complete the theorem proof.) It is 
then a standard application of the Foster-Lyapunov criteria to conclude that 
for all sufficiently large r the system Markov process is positive recurrent, 
and moreover, the stationary distributions are such that EW r = ^ ViEx\ 
remains uniformly (in r) bounded. Indeed, for any fixed initial state of the 
process, consider the embedded chain at times 0, T, 2T, .... It easily follows 
(using the fact that each input flow is Poisson, and fluid scaling is applied) 
that < E£ r (nT) = E[W r (nT)} 2 < oo for all n = 0, 1, 2, . . .. Also, WLOG, 
by rechoosing if necessary C\ and C2, we can assume that the "then" part 
of (5) holds for any C r (t). We see that, for any n, 

EC r ((n + 1)T) - E£ r (nT) < -CiEW r (nT) + C 2 \ 

from here the positive recurrence and steady-state bound EW r < C2/C1 
easily follow, because the opposite would imply EC r {nT) — > —00 as n — > 00. 

Uniform bound on EW r implies tightness of invariant distributions. The 
tightness together with Proposition 13 imply that the sequence of invariant 
distributions must weakly converge to the point mass at equilibrium. 

It remains to show property (5)-(6). First, it is easy to see (and is a 
standard observation) that 
(7) 

VT > 0, E[W r (t + T) - W r (t)] 2 are uniformly bounded across all r and t. 

This guarantees (6) for any fixed K. To prove (5), we fix T\ > as in 
Lemma 15, and then choose a large fixed T > T\. Note that 

(minfj)(maxg[(t)) < W r (t) < (maxi/j)(Jmaxg-(f) + ^ fy); 

j 

in particular, the condition maxj e jg[(0) — > 00 in Lemma 15 is equivalent to 
VF r (0) —7- 00. If we fix a sufficiently small e' > and apply Lemma 15, we 
obtain the following fact: 

for a sufficiently large fixed K > (as a function of T), uniformly on all 
C r (0) > K and all large r, 

(8) F{W r (T) - W r (0) < 2pT l - - p){T - T x )} > 1 - e' . 

Indeed, the 2pT\ is a crude upper bound on W r (Xi) — W r (0), which holds 
with high probability (w.h.p.) for large r, since by (2) new workload arrives 
at average rate p (in the fluid-scaled system). The term — (1/2) (1— p)(T— T{) 
is an upper bound on W r (T) — W r (Ti), also holding w.h.p., because by 
Lemma 15 the average rate at which workload leaves the system is w.h.p. 
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close to 1 in [Ti,T]; and recall that p < 1. This proves (8). The RHS of 
the first inequality in (8) is negative if we choose T large enough. This, 
along with (7), implies (5), since we have the identity C{t + T) — C r (t) = 
2W r (t)(W r (t + T) - W r {t)) + [W r (t + T) — W r {t)f. □ 

5. Proof of Theorem 10(ii). 

5.1. Preliminaries. In the previous section we have shown that the pro- 
cess ($£■(•), Q[ (•)) is positive recurrent, and then has unique stationary (or 
invariant) distribution for all large r (which proved Theorem 10(i)). More- 
over, 



(9) 



0. 



Here and in the rest of the paper (vE^-, Q£) means "(^-(i), Q\{t)) in sta- 
tionary regime." 

So, we know that Theorem 10(ii) is true for e = 1/2, and our goal is to 
prove it for any e > 0. In what follows 0<e<l/2is fixed. 

From (9), for an arbitrarily small fixed 5 > 0, we can choose a positive 
function g(r) = o(r), such that, 



(10) 



r^,Ql)\<g(r)}>l-5. 



Without loss of generality, assume r~ 1 l 2 ~ e g(r) — > oo. 

We will prove that there exist positive constants C and T, such that for 
any fixed 5\ > the following holds for all sufficiently large r: 

(11) |(*r.( ) -rV*-,Ql(0)) I < g(r) implies 

(T log r) - ri>*j , Ql (T log r)) | < Cr 1 ' 2 ^} > 

This fact, along with (10), implies that for all large r, in steady-state, 

P{| (% ~ rtfj, Q\) | < Cr 1 ' 2 ^} > (1 - S)(l - 6,). 

This clearly proves Theorem 10(ii), because 5, 5\ can be chosen, and e re- 
chosen, to be arbitrarily small. So, the rest of Section 5 is the proof of (11), 
with the final part of the proof given in Section 5.4. 

We will need FSLLN-type results, which can be obtained from a strong 
approximation of Poisson processes, available e.g. in [3, Chapters 1 and 2]: 
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Proposition 17. A unit rate Poisson process II(-) and a standard Brow- 
nian motion W(-) can be constructed on a common probability space in such 
a way that the following holds. For some fixed positive constants C\, Ci, C3, 
such that VT > 1 and Vu > 

P( sup \U(t)-t-W(t)\> d log T + u) <C 2 e- QiU . 

\0<t<T J 

From here, for the unit rate Poisson processes nj a) (-) and U\f(-), driving 

exogenous arrivals and departures, we obtain the following fact. (For II^ (•), 
for example, we replace t with A^ri; T with A^rTlogr; and u with r 1//4 .) 

Proposition 18. For any fixed T > and any subsequence of r — > 00, 
we can find a further subsequence (with r increasing sufficiently fast), such 
that: 

for each i 

U\ a) {\irt)- Xtrt 



sup r- 1 ' 2 -^ 2 



0<t<Tlogr 

and for each (ij) 



sup r- 1 ! 2 ^ 2 



0, w.p.l, 



0, w.p.l. 



0<*<Tlogr 

Let F r (t) be the process of (unsealed) deviations from equilibrium; that 

is, 

F r {t) = {% j {t)-r^ j ,Q\{t))- 

Suppose we have a function h(r), such that r 1//2+e < h(r) < g(r). (The 
quantity h(r) will be the "scale" of |.F r (0)|; sometimes, we simply use h(r) = 
|F r (0)|, but not necessarily.) We will establish properties of F r (-) under two 
different scalings, called hydrodynamic and local-fluid. 

We remark that the use of multiple scalings (in addition to the "standard" 
fluid scaling) is typical in the analysis of systems in many-server asymptotic 
regime, cf. [4] and references therein. However, our hydrodynamic and local- 
fluid scalings are somewhat unusual in that the scaling factor h(r) is strictly 
"between" r and r 1 / 2 . (When h(r) = r, both local- fluid and hydrodynamic 
scalings become the standard fluid scaling; if h(r) = r 1 / 2 , the local-fluid 
scaling becomes the standard diffusion scaling.) Also, the system behavior, 
of course, depends on the control discipline, LAP in our case; and so our 
analysis of LAP under various scalings is new. Most importantly, the way we 
use these multiple scalings for the purposes of proving tightness of stationary 
distributions is novel, to the best of our knowledge. 



STEADY-STATE TIGHTNESS, PRIORITY DISCIPLINE 19 

5.2. Hydrodynamic scaling. Consider the process under the following 
scaling and centering: 

(12) (^.(t),g[(t),x[(t),ar(t),4(t),^.(t)) = 
^(r)- 1 (^((%)r-H)-r^,Ql(^(r)r-H),X[(^(r)r- 1 f)-r^^i, 

3 

A\Xh{r)r- l t),D^{h{r)r- l t)^ l3 (h(r)r-H)) . 

Theorem 19. Consider a sequence of deterministic realizations, such 
that the driving realizations satisfy FSLLN conditions, namely: 

(13) (aj(i), t > 0) -»■ (A^, f > 0), U.O.C., Vi 
(14) 

(hiry^D^h^r-H)-^ %(s)ds), t > o) -> 0, w.o.c, V(ij). 

Suppose (^-(0)^(0)) ->• (^(0),^(0)). 

Then, for any subsequence ofr there exists a further subsequence along which 
(V'ij(')) <?[(')> ^T(')) ^i(')> d-ij(-), £«(")) converges uniformly on compact sets to 
a set ( , 4'ij(-),qi(-),Xi(-),ai(-),dij(-),^ i j(-)) of Lipschitz continuous functions 
satisfying conditions (15). fT/ie conditions involving derivatives are to be 
satisfied at regular time points t > of the limiting set of functions.) 

The hydrodynamic model conditions are: 

(15a) &(*)><), VieX; X) 0, V J G ^ 

j 

(15b) ai(t) = Xit, Vi£l; dij(t) = f-iij^tjt, V(z'j') € £ 

(15c) ?<(*)= 3i(0) + 3i(*)- ?y(t), ViGX; 

^ (t) = ^ (0) + (t ) - 3y (t) , Vi E X 

(15d) x i (t) = g < (t) + 5^^ i (t) = x i (0), VieX 
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(15e) y]^ 7 -(t) = 0, whenever (£) > for at least one i' G C(j) 



(15f) = ^' whenever #$/(t) > for at least one E C(j), i' < i 

(15g) = 0> whenever ^^^^-/(t) < for at least one (if) < (ij) 

k 

( 15h ) a^(*) = £wtf'i- E 

i' {i'j)<{ij) 

whenever §(-(£) > (and then necessarily ^J^y(i) = 0) 



(15i) = £ |^,(t), 

(ij')<(ij) 

whenever ^JV'jyW < (and then necessarily q^t) = 0) 



(15j) 

\ (if)<(ij) >' (i'j)<(ij) j 

whenever = and ^^tp k j(t) = 0. 

k 

There is a clear correspondence between the hydrodynamic model and 
fluid model conditions. This is not surprising, of course, - the hydrody- 
namic limit is also an FSLLN-type limit, but on a different, finer time and 
space scale. We omit the proof of Theorem 19 - it is analogous to that of 
Proposition 11. 

We call any Lipschitz solution (^y(0>9i(0j^i(0>^(0'^y(0>£i.7'(')) °f (15) 
a hydrodynamic model (HM) of the system with initial state (ip^ (0), q^O)); 
a set (•), %(■)), which is a projection of an HM we often call a hydro- 
dynamic model as well. Also, we sometimes use shorter notations f "(■) = 

%•(•),<£(■)), 70 = (AA-),U))- 

We have the following corollary of Theorem 19 which we record for future 
reference. 
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Corollary 20. For any fixed T > 0, K > and 62 > 0, there exists 
a sufficiently small 63 > 0, such that the following holds. Uniformly on all 
\f (0)| < K and all sufficiently large r, conditions 

(16) max sup |aj*(£) — < 63, 

i [0,T] 



(17) maxsuplMO -1 ^-^)^ 1 *)-^ f^' ' %(s)ds)\ < 5 3 , 

(y) [o,t] Jo 

imply 

(18) sup |/ r (t) <5 2 , 

[0,T] 

where /(•) is a hydrodynamic model. 

Proof. Suppose not. Fix T,K,82- There must exist a sequence J3 \. 0, 
and a corresponding sequence r = r(^) f °°; such that (16), (17) and 
the convergence / (0) — > /(0) of initial states hold, but (18) fails for any 
hydrodynamic model. This, however, is impossible, because according to 
Theorem 19 (or rather its version, specialized to a finite time interval, to 
be precise) we can choose a further subsequence of r along which / (t) — > 
f(t), uniformly in [0, T], where /(•) is a hydrodynamic model starting from 

7(o). - ^ . • • . ^ 

Theorem 21. For any K > there exists a finite time T = T(K) and 
constant C = C(K) > such that all hydrodynamic models with |/(0)| < K 
satisfy the following conditions: Yli^iji^) = 0,Vj < J, §^(T) = 0, Vi G I, 
and J{t) = J(T) for all t > T; maxi> |7WI < CK. Moreover, 
(ipijiT), c[i(T)) = LiipijiQ), gj(O)), where L is a fixed linear mapping. 

Proof. Consider a fixed HM /(•). Consider the highest priority activity 
(lj). There are two possible cases: j is a leaf or 1 is a leaf. 
Case a: If j is a leaf, then ipij(t) < at all times, and ipi~(t) must increase 
at positive rate, bounded away from 0, until it reaches within a finite time. 
Thereafter, ipij(t) wm stay at 0. (The argument is very similar to Case b in 
the proof of Proposition 13.) 

Case b: If type 1 is a leaf, then qi(t) must decrease and ipij{t) increase at the 
same rate (positive, bounded away from 0), until the entire queue (if any) 
"relocates into" after that time, i^ijit) and q\{t) = will not change. 
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We see that in either case a or b, after a finite time, the highest priority 
activity (lj) can be in a sense "ignored". This allows us to proceed by 
induction on the activities, from the highest priority to the lowest, to check 
that by some finite time T (depending on K) the hydrodynamic model gets 
into a state f(T), satisfying conditions of the theorem, and will stay in this 
state for all t >T. Since all HMs are uniformly Lipschitz, we obviously have 
a uniform bound maxt>o \f(t)\ < CK for some C. 

Furthermore, since all Xi(t) do not change with time, the linear mapping 
L is as follows: L(uij,Wi) = (Qj,0) where (cij) is the unique solution to 

(19a) u ij + w i • = Vi G X 

3 3 

(19b) J>^' = 0, Vj<J 

i 

□ 

Remark 22. Examination of the proof of Theorem 21 reveals that the 
HM for any initial state is in fact unique. Moreover, with a little further 
argument, it is easy to show that an HM depends on the initial state con- 
tinuously. Furthermore, the HMs are scalable: if (f(t), t > 0) is an HM, 
then so is (f(ct)/c, t > 0) for any c > 0. From here, it is easy to find that 
the theorem statement holds for a constant C independent of K and for 
T = CK. We will not need these stronger properties in this paper. 

For future reference, note that L(uij,Wi) = (cy,0) is a function only of 
the vector (z%), where z% = wi + J2j u ij- The corresponding linear mapping 
from (zi) to (cjj), we denote L' . 

5.3. Local-fluid scaling. The process under local fluid scaling is as fol- 
lows. For each r consider 

(r ij (t)^(t))^r(t) = h(rr i F r (t). 

We will also denote = h^^X^t) = qV(t) + £V ^.(t). 

Since ^L(-) (as well as ^jj(-)) is centered before it is scaled in space, we 
in particular have (by Assumption 7) that Yli — for all j < J at all 

times t. 
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Theorem 23. Consider a sequence of deterministic realizations, such 
that the driving realizations satisfy FSLLN conditions, namely: 

(20) {h{r)~ l {A^t) - Xirt), t > 0) -»■ 0, u.o.c, Vi 

(21) (hirr^DKt) - iiij J %(s)ds), t > 0) -> 0, u.o.c, V(ij). 

Assume that the initial states converge to a fixed vector (?/£-(0), <?[(0)) — > 
(tpij(Qi), q~i(Q)). Further assume that qi(0) = 0,Vz, and $^*/>ij(0) = f or a ^ 
j < J. (In other words, (ipij(0),q~i(0)) = L(ipij(Q),q~i(0)).) Then, for any 
subsequence of r there exists a further subsequence along which 

(22) «(•)). 

where (V'w(')i * s a se ^ °/ Lipschitz functions, with initial condition 
(ipij(0),q~i(0)), satisfying (local fluid model) conditions (24). Moreover, these 
limit trajectories depend continuously on the initial state and are such that, 
uniformly on all of them, 

(23) |(fe(<),9i(*))| ^ l(^'(0),%(0))|c ie - C2t , Vt > 0, 
where c±,C2 > are fixed constants. 

The local fluid model conditions are as follows: 
(24a) qi{t) =0, Vi G 2" 



(24b) = <M°) - Z) / »ij$ij( s )ds, Vi e X 

i i j 

(24c) ^4(i) = 0, Vj<J 

i 

The J + J — 1 equations for the J + J — 1 functions (ipij(-)) can be solved se- 
quentially, in order of decreasing activity priority, since the highest unsolved- 
for priority will always correspond to either a customer-type or a server-type 
leaf of the remaining activity tree. Any Lipschitz trajectory satisfying (24) 
we will call a local fluid model (LFM). Conditions (24) reduce to a system of 
linear ODEs for (ipij(t)), which of course implies the continuous dependence 
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on initial state; the fact that each LFM converges to is easily established, 
again by induction on activities; therefore, we obtain the uniform exponen- 
tial bound (23). 

Analogously to f '{-) = (^•(•), ql(-)), we will use shorter notation /(•) = 
($#(•)>&(•))• 

Proof of Theorem 23. The non-trivial part of the proof is showing 
the Lipschitz property of the limit /(•), because it is no longer a simple con- 
sequence of the FSLLN for the driving processes (as it was for the fluid and 
hydrodynamic limits). This is because the arrival and service rates in the sys- 
tem (with index r) are 0(r), while the space is scaled down by h(r) = o(r). 
For the same reason, it is also not "automatic" that the limit queues qi(-) 
stay at 0. This difficulty is resolved as follows. Consider arbitrary number 
C 4 > ||0/>y(0))||, and the random time r(r) = min{£ | || > C 4 }. 
Then, speaking informally (the formal statements are given below), the tra- 
jectory x[(-) for each i must be "almost Lipschitz" in the interval [0,r(r)], 
with the Lipschitz constant rj = C±\\(nij)\\, because the absolute difference 
between the arrival and departure rates (scaled down by h(r)) is upper 
bounded by r/ in [0,t(t-)]; similarly, each queue length trajectory g[(-) is 
"almost Lipschitz" in [0,r(r)]. 

Formally, it is easy to show the following: if r(r) — > along some subse- 
quence, then (denoting Xj(0) = Y^j ^ij'(O)) 

(25) sup \\(xl(t)) - (5*(0))[| -»• 0, sup \M(t)) - (ft(0))|| -+ 0. 

[0,r(r)] [0,r(r)] 

If liminf r(r) > e 4 > along some subsequence, then there exists a further 
subsequence along which 

(26) (aft-)) (*>(•)), (£(■)) («(0), 

where the convergences are uniform in [0,64], and each function Xj(-) and 
qi(-) is Lipschitz with constant r/ in [0,64]. 

In the case 7~(r) — > 0, as a consequence of (25), we also must have 

(27) sup ||(^(t))-(^(0))||-^0. 

[0,r(r)] 

Indeed, if (27) does not hold, then for some fixed 5 > 0, we can choose a 
subsequence of r and a corresponding sequence of times n(r) G [0, r(r)] such 
that ||(^-(*))-(^tj(0))|| > 5 for the first time. Fix T > (we will specify the 
choice later), and consider the sequence of times ri(r) — Th{r)/r. Suppose 
first that Ti(r) — Th(r)/r > for infinitely many r; then, we consider a 
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further subsequence along which this holds, and the trajectory on the time 
interval [ri(r) — Th(r)/r,Ti(r)]. Now, if we reset the time origin to T\(r) — 
Th{r)/r and "stretch" the interval [ri(r) — Th(r) /r,Ti(r)] by the factor 
r/h(r), we will obtain hydrodynamic-scaled trajectories in the interval [0, T\. 
We then choose a further subsequence of r along which these trajectories 
converge (u.o.c.) to an HM. This HM /(•) will be such that (xi(Q)) = (^(0)), 
||(^.(0)) - (4(0))|| < 6, all qM = 0, and ||(^(T)) - (0))|| > 5. We 
now specify the choice of T: it is large enough so that (^ -(T)) = L'(xi(0)). 
But, (xi(0)) = (xi(0)), which means (^(T)) = L'(xi(0)) = (^(0)) - a 
contradiction. The contradiction in the case when Ti(r) — Th{r)/r < for all 
large r is obtained similarly, except for all r we use time interval [0, Th(r)/r] 
to construct a contradicting HM. Thus, we proved (27). But, this leads to a 
contradiction with the definition of r(r). We conclude that the case r(r) — > 
is in fact impossible, and we always have the case liminfr(r) > 64 > and 
(26). 

Next, in addition to (26), we show that 

(28) \f(t) - Lf(t)\ -> 0, in particular |(^(t)) - L'{x\{t))\ -»• 0, 

uniformly in [0, €4]. (This is, again, proved by contradiction. If (28) would not 
hold, we would be able to construct an HM violating the claim of Theorem 21 
that f{t) = Lf{t) must hold after a finite time. We omit details which are 
analogous to those in the proof of (27) above.) 
In [0, £4] we also have 

Xi(t) = Xi(0) - di(t), Vz, 
where the Lipschitz function di(-) is a limit (along a subsequence) of 

Ej- Jo* »iji>ij( s ) ds - 

The above properties lead to conditions (24) on the interval [0,64]. 
Namely, we formally define (ipij(-)) = L'(xi(-)), obtain the convergence 
-> ($#(■)) from (28), and then (24) follows. 

Finally, as already observed earlier, the linear ODE (24) solutions satisfy 
condition (23). In particular, each local fluid model remains bounded in 
[0,oo). This in turn allows us to conclude that by choosing a sufficiently 
large C4, the corresponding €4 can be arbitrarily large. This completes the 
proof. □ 

We will actually need a generalized version of Theorem 23. 

Theorem 24. Consider a sequence of deterministic realizations, such 
that the driving realizations satisfy (20)-(21). Assume that the initial states 
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converge to a fixed vector / r (0) — > /°(0). (We do not assume /°(0) = 
Lf°(0).) Then, for any subsequence of r there exists a further subsequence 
along which 

(29) f (•) -> /(•), 

uniformly on compact subsets of [0,oo) not containing 0, where /(•) is a 
local fluid model with initial state /(0) = L/°(0). (Recall that (23) holds for 
any LFM.) In addition, for any K > there exists C = C{K) > such that 
|/°(0)| < K implies 

(30) limsup sup \f(t)\<CK. 

r-p-oo 0<t<l 

Proof. The proof is a slight generalization of that of Theorem 23. For 
a fixed T5 > consider the interval [0, T^h(r)/r], and the corresponding 
hydrodynamic-scaled trajectories in the interval [0, T5] ; T5 is chosen large 
enough so that the hydrodynamic model reaches state /(0) = L/°(0) by time 
T5. Then, we must have f r (T^h(r)/r) — > /(0); moreover, by Theorem 21 and 
Corollary 20, 

limsup sup \f(t)\ < CK, 

r^oo 0<t<T 5 h(r)/r 

for some C, when |/°(0)| < K. 

Then, we consider local fluid scaled trajectories starting time point 
T§h(r)/r (as opposed to 0), and the rest of the proof is essentially same 
as that of Theorem 23. □ 

The following corollary is derived from Theorem 24 analogously to the 
way Corollary 20 was derived from Theorem 19. 

Corollary 25. For any fixed K > 0, there exists C = C(K) > such 
that the following holds. For any fixed T > 0, 62 > and €2 > 0, there exists 
a sufficiently small 63 > such that: uniformly on all |/ r (0)| < K and all 
sufficiently large r, conditions 

(31) max sup \h{r)~ l (A\(t) - A»rt)| < <5 3 , 

* [0,T] 

(32) max sup ^(r)" 1 (D^(t) - ^3 f %As)ds)\ < 5 3 , 

W) [0,T] Jo 
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imply 

(33) sup|f(t)|<(K + l)C7, 

[0,21 

(34) sup \f r (t)-f(t)\<5 2 , 

[«,T] 

where /(•) is a local fluid model with |/(0) — L/ r (0)| < <52- 

5.4. Proof of Theorem 10(H). We are now in position to prove (11), 
and then Theorem 10 (ii). The basic idea is to consider the process in the 
interval [0,Tlogr], subdivided into logr intervals, each being T-long. (To be 
precise, we need to consider an integer number, say [logr], of subintervals. 
This does not cause any difficulties besides making notation cumbersome.) 
Then, using the local fluid limit results, we show that, with high probability, 
in each of the T-long subintervals, the norm |_F r (f)| decreases by a factor 
#6 E (0,1), unless the norm |-F r (t)| at the beginning of the subinterval is 
below r 1//2+e - in this case |-F r (i)| will be bounded above by 3Cr 1 / 2+<E in the 
entire subinterval (where C is as in Corollary 25 with K = 2). Now, if 5q is 
small enough, so that 

(35) 4 ogr < r^+Vr, that is S 6 < e" 1 / 2 ^, 

this means that |i ?r (t)| must "dip" below r 1//2+<E at least once, and there- 
fore \F r (T log r)\ < ?>Cr l l 2+e (with high probability). We proceed with the 
details. 

Let us choose 5q > satisfying (35), and then 82 > such that 2S2 < 5q. 
Denote by \L\ the norm of the linear operator L (defined in Theorem 21), 
i.e. the maximum of absolute values of its eigenvalues. Let us choose T > 
large enough so that (see Theorem 24) \L\c\e~ C2T < 6%. 

Suppose, for each r the initial state is as in (11). To prove (11) it suffices 
to show that from any subsequence of r we can find a further subsequence, 
along which (11) holds. So, consider any fixed subsequence, and a fixed 
Si > 0. 

In each of the subintervals [(i — l)T,iT], i = 1, 2, . . . ,logr, we consider 
the process with the time origin reset to (i — 1)T and the corresponding 
initial state F r ((i - 1)T); and if \F r ((i - 1)T)| < g(r), then we set h(r) = 
max(|F r ((i - l)T)\,r l / 2+e ). (If \F r ((i - 1)T)| > g{r) we set h(r) = g{r) 
for completeness.) By Proposition 18, we can choose a further subsequence, 
with r increasing sufficiently fast, so that, w.p.l, conditions (31) and (32) 
hold for all large r, simultaneously on each of the subintervals [0, T], [T, 2T], 
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[T(logr — l),Tlogr]. We consider the corresponding local fluid scaled 
processes f r (-), with their corresponding h(r), on each of the subintervals; 
and apply Corollary 25. We see that, with probability 1, for all large r, the 
following holds for each interval [(i — 1)T, iT], 1 = 1,2,..., logr: 
if \F r ((i - 1)T)| G [r 1/2+t ,g(r)] then \F r (iT)\ < 25 2 \F r ((i - 1)T)|; 
if \F r ((i - 1)T)| < r l / 2+e then |F r (iT)| < 3CV/ 2+e . 

Since 25% < Sq we must have \F r (iT)\ < r l l 2+t for at least one i. Finally, 
we conclude that condition \F r (T\ogr)\ < 2>Cr l l 2+t must hold (w.p.l for 
all large r). This obviously implies (11). 
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